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NOVEL HEPATITIS C VIRUS PEPTIDES AND USES THEREOF 



Government Funding 

This woik was funded, in part, by NIH grants DK 50795 and DK 52071 . The 
5 government may, therefore, have certain rights to this invention. 

Background of the Invention 

Hepatitis C virus (HCV) is closely related to both the pestivirus and flavivirus 
genera in the Flaviviridae family. HCV is a single stranded RNA virus; the viral 

1 0 genome is approximately 9.5 kb. HCV RNA is positive sense and has a unique open 
reading fiame which encodes a single polyprotein (Clarke. 1997. J. Gen. Virol. 
78:2397). The polyprotein is proteolyticly processed to yield the mature viral proteins 
which include: nucleocapsid, envelope 1, envelope 2, metalloprotease, serine protease, 
RNA helicase, cofactor, and RNA polymerase. 

1 5 HCV is a major human pathogen. The virus was found to be the cause of most 

cases of hepatitis which could not be ascribed to hepatitis A, hepatitis B, or hepatitis 
delta vims (Clarke, supra). Over fifty percent of patients with hepatitis C virus (HCV) 
become chronic carriers of the virus; there may be as many as 500 million chronic 
carriers worldwide (Dhillon and Ducheiko. 1995. Histopathology 26: 297). Persistent 

20 infection with the virus causes chronic hepatitis and may ultimately lead to cirrhosis 
and/or cancer (Kuo et al. 1989. Science 244:362). Cunrent therapies for HCV are 
ineffective, consequently there is a need for new approaches to treat HCV infection. 

Summary 

25 The present invention is an important advance in the battle against hepatitis C. 

The novel peptides of the invention, \**ich are not encoded by the standard, polyprotein 
HCV reading frame, have been shown to elicit an immune response in patients infected 
with HCV and, thus, are produced during HCV infection. Accordingly, the invention 
provides novel HCV polypeptides vstoch are not derived fifom the HCV polyprotein and 

30 methods of their use. 



t 
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In oiie aspect, the invention pertains to an isolated or recombinant polypeptide or 
fragment thereof encoded by a nucleic acid molecule derived from a hepatitis C virus, 
which polypeptide has at least one of the following characteristics: 

1) at least a portion of the polypeptide is encoded by a reading frame +1 or +2 
5 relative to the standard hepatitis C virus open reading fitime; 

2) at least a portion of the polypeptide is encoded by a reading frame 
corresponding to the reading frame of SEQ ID NO: 1 in which the first nucleotide of 
SEQ ID N0:1 is the first nucleotide of a codon; 

3) at least a portion of the polypeptide comprises an amiiio acid sequence at least 
10 60% identical to the amino acid sequence showoi in SEQ ID N0:2; and 

4) at least a portion of the polypeptide comprises an amino acid sequence 
encoded by a nucleic add molecule which hybridizes under high stringency to the 
nucleotide sequence shown in SEQ ID NO: 1 . 

In certain embodiments of the invention the novel HCV polypeptides or portion 
1 5 thereof of claim 1 are at least about 8 amino acids to at least about 1 00 amino acids in 

length. In other embodiments, the polypeptides or portions thereof are at least about 14 

amino acids to at least about 30 amino acids in length. 

In one embodiment, the novel HCV polypeptides or portions thereof are encoded 

by a reading frame +1 or +2 to the standard hepatitis C reading frame. In preferred 
20 embodiments, the polypeptides are encoded by a reading frame corresponding to the 

reading frame of SEQ ID N0:1 in which the first nucleotide of SEQ ID N0:1 is the first 

nucleotide of a codon. In otiier preferred embodiments, the polypeptides are encoded, in 

part by the nucleic acid molecule of SEQ ID NO:l and cause an inunune response in a 

subject. 

25 In other embodiments, the novel HCV polypeptides comprise an amino acid 

sequence at least 60% identical to the amino acid sequence shown in SEQ ID N0:2 and 
cause an immune response in a subject. In preferred embodiments, the novel HCV 
polypeptides comprise an amino acid sequence at least 90% identical to the amino acid 
sequence shown in SEQ ID N0:2 and causes an imrhune response in a subject In other 

30 preferred embodiments, tiie novel HCV polypeptides comprise an amino acid sequence 
shown in SEQ ID NO: 2 and cause an immune response in a subject. 
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In other embodiments, the novel HCV polypeptides comprise an amino add 
sequence encoded by a nucleic acid molecule which hybridizes under high stringency to 
the nucleotide sequence shown in SEQ ID NO: 1 . 

In other embodiments, the novel HCV polypeptides comprise at least a portion of 
5 an amino acid sequence selected from the group consisting of SEQ ID NO: 3, SEQ ID 
N0.4, SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, and SEQ ID N0:8 and cause an 
immune response in a subject. 

In still other embodiments, the invention pertains to isolated or recombinant 
polypeptides comprising an amino acid sequence selected from the group consisting of: 
10 LNLKEKP(X1)(X2)TPT(X3) and AAHRT(X4)SSR(X5)(X6)VR, wherein XI isN or K, 
X2 is V or E, X3 is A or V, X4 is L or S, X5 is A or V, and X6 is A or V. In yet other 
embodiments, the novel HCV polypeptides consist of an amino acid sequence selected 
from the group consisting of LNLKEKPNVTPTAC and AAHRTSSSRAWRC. 

In another aspect, the invention pertains to a vaccine composition for preventing 
1 5 hepatitis C infection in a subject. In one embodiment such a vaccine comprises a novel 
HCV polypeptide. In another embodiment, such a vaccine comprises a nucleic acid 
encoding a novel HCV polypeptide. 

In another aspect, the invention pertains to an antibody which binds to a novel 
HCV polypeptide. 

20 In yet another aspect the invention pertains to a kit for detecting a hepatitis C 

infection. In one embodimetn, such a kit comprises a novel HCV polypeptide. In 
another aspect, the kit comprises an antibody which binds to a novel HCV polypqatide. 

In yet another aspect, the invention pertains to a method of preventing HCV 
infection by administering a novel HCV polypeptide to a subject or by causing said 

25 polypeptide to be synthesized is a subject prior to HCV infection such that HCV 
infection is prevented. 

The invention also pertains to methods of diagnosing HCV infection. In one 
embodiment, the method comprises detecting the presence or absence of antibodies 
which react with a novel HCV polypeptide in the body fluid of a subject, wherein the 

30 presence of antibodies which bind the polypeptide is indicative of an infection wiA 
HCV. In another embodiment, the method comprises detecting the presence or absence 
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of a novel HCV polypeptide in the body fluid or tissue of a subject, wherein the presence 
of an HCV polypeptide is indicative of an infection with HCV. 

In still another aspect, the invention pertains to a method for identifying a 
compound which interacts with a novel HCV polypeptide by contacting the polypeptide 
5 with a compound in a cell-free system under conditions which allow interaction of the 
compound with the polypeptide such that a complex is formed; separating the 
compounds which do not form complexes with an HCV polypeptide from those which 
do form complexes with an HCV polypeptide; and isolating and identifying the 
compounds which form complexes v^rith an HCV polypeptide to identify a compound 
10 which interacts with a novel HCV polypeptide. 

Detailed Description 

The present invention is an important step forward in preventing Hepatitis C 
(HCV) infection, in treating ongoing infection, and in improving existing diagnostic 
15 techniques. The invention is based, in part, on the identification of novel polypeptides 
encoded by the Hepatitis C viral genome. These novel polypeptides are not encoded by 
the standard HCV polyprotein reading frame. 

Before further description of the invention, certain terms employed in the 
specification, examples and appended claims are, for convenience, collected here. 

20 

/. Definitions 

As used herein, the language "isolated or recombinant polypqitide" includes a 
polypeptide which is substantially free of cellular material or culture medium when 
produced by recombinant DNA techniques, or chemical precursors or other chemicals 

25 when chemically synthesized. 

As used herein, the term "polypeptide or fragment thereof* includes full-length 
polypeptide molecules (from the first amino acid of translation initiation to the last 
amino acid prior to translation termination) and peptide portions of such molecules. 
Preferably the novel HCV polypeptides or fragments thereof are at least about 8 amino 

30 acids to at least about 100 amino acids in length. More preferably the polypeptides or 
fragments thereof are at least about 14 amino acids to at least about 30 amino acids in 
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length. In preferred embodiments, the novel HCV polypeptides of the invaition 
comprise an amino acid sequence which is conserved among different HCV isolates. 
Such a conserved sequence can readily be determined using an aligiunent such as that 
provided in Table 1. In other embodiments, the novel HCV polypeptides comprise a 

5 portion of a novel HCV amino acid sequence which is distal (carboxy terminal) to a stop 
codon in the +1 reading frame (relative to the main ORF) that includes the "UG" of the 
"AUG" that is the initiator codon of the main ORF. In a more preferred embodiment, 
the polypeptides or fragments thereof cause an immune response in a subject 

As used herein, the language "nucleic acid molecule" is intended to include DNA 

10 molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., genomic viral RNA 
or mRNA). The nucleic acid molecule may be single-stranded or double-stranded. 

As used herein, the language " the standard hepatitis C virus open reading frame" 
is the open reading frame (ORF) of the viral RNA which encodes the well4mown HCV 
polyprotein. The standard ORF represents the largest ORF in the viral genome. In the 

1 5 infectious clone (GenBank accession number AFO 1 1 75 1 ) the standard ORF uses 

nucleotide 342 as the first nucleotide of a codon and continues until nucleotide 9377. In 
different HCV isolates, the nucleotide which is a first nucleotide of a codon of the 
standard ORF may be at a slightly different position. The nucleotide which is a first 
nucleotide of a codon for any isolate can be easily be obtained to yield the standard HCV 

20 ORF. For example, in the case of known isolates GenBank (or another database 

containing the nucleotide sequence information for the isolate) can be accessed and the 
coding sequence (CDS) information can be obtained. Alternatively, to detemine the 
standard ORF of a known or a new isolate, the nucleic acid sequence of the known or 
new isolate can be aligned with a known sequence to give the highest homology (e.g., 

25 using a program such as BLAST). An exemplary BLAST search can be done, e.g., 
using the sequence foimd in GenBank accession number AFOl 1751, as the query 
sequence. In this search, nucleotides 342-940 of AFOl 1751 were used to search the non- 
redundant sequence database. The ORF of other HCV isolates which corresponds to the 
standard HCV ORF of AFOl 1 75 1 (in which the initiation codon is at position 342, 

30 which is read as position 1 of the query sequence) can be read from the BLAST 

alignment. For example, the corresponding first nucleotide of a codon for GenBank 
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accession no. HPCCGAA is 342. Another way to find the standard ORF would be to 
use a program, such as Edit Seq. (DNASTAR) which is designed to identify ORFS using 
the AUG aligned with position 342 of AFOl 1751 as the start codon. 

The term " percent (%) identity" as used in the context of nucleotide and amino 

5 acid sequences (e.g., when one ammo acid sequence is said to be X% identical to another 
amino acid sequence) refers to the percentage of identical residues shared between the 
two sequences, when optimally aligned. To determine the percent identity of two 
nucleotide or amino acid sequences, the sequences are aligned for optimal comparison 
purposes (e.g., gaps may be introduced in one sequence for optimal alignment with the 

1 0 other sequence). The residues at corresponding positions are then compared and when a 
position in one sequence is occupied by the same residue as the corresponding position 
in the other sequence, then the molecules are identical at that position. The percent 
identity between two sequences, therefore, is a function of the number of identical 
positions shared by two sequences (/.e., % identity = # of identical positions/total # of 

15 positions X 100). 

Computer algorithms known in the art can be used to optimally align and 
compare two nucleotide or amino acid sequences to define the percent identity between 
the two sequences, A preferred, non-limiting example of a mathematical algorithm 
utilized for the comparison of two sequences is the algorithm of Karlin and Altschul 

20 (1990) Proc. Nati. Acad. Sci. USA 87:2264-68, modified as in Karlin and Altschul 

(1993) Proc. Nati. Acad. Sci. USA 90:5873-77. Such an algorithm is incorporated into 
tiie NBLAST and XBLAST programs of Altschul, et al. (1990) J. Mol. Biol. 215:403- 
10. To obtain gapped alignments for comparison purposes. Gapped BLAST can be 
utilized as described in Altschul et al., (1997) Nucleic Acids Research 25(17):3389- 

25 3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of 
the respective programs (e.g., XBLAST and NBLAST) can be used. See 
http://www,ncbi.nlm.nih.gov. 

Another preferred, non-limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). 

30 Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of 
the GCG sequence alignment software package. When utilizing tiie ALIGN program for 
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comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty 
of 1 2, and a gap penalty of 4 can be used. If multiple programs are used to compare 
sequences, the program that provides optimal alignment (i.e., the highest percent identity 
between the two sequences) is used for comparison purposes. 
5 As used herein, the language "+1 or +2 relative to the standard hepatitis C virus 

open reading frame" includes reading frames in which a first nucleotide of a codon is 
shifted +1 nucleotide relative to the standard ORF or +2 nucleotide relative to the 
standard ORF. The reading frames encoding the novel polypeptides do not necessarily 
contain an in-firame start codon. 
10 As used herein, the language "the reading frame of SEQ ID NO: 1 " means that the 

first three nucleotides of the sequence shown in SEQ ID NO: 1 are the first second and 
third nucleotides of a codon for translation into an amino acid of a polypeptide. The 
reading frame of SEQ ID N0:1 is +1 relative to the standard HCV ORF. The language 
"a reading frame conespQnding to the reading frame of SEQ ID N0:1" means that when 
1 5 a sequence from an HCV isolate other than the AFOl 1 75 1 isolate shown in SEQ ID 
N0:1 is aligned with the sequence of SEQ ID N0:1 to give the highest homology, e.g., 
using the BLAST program, it is then read in the same reading frame as SEQ ID NO: 1 to 
give the reading fi^e corresponding the reading frame of SEQ ID NO: 1 . The 
nucleotide position of a first nucleotide of a codon of an HCV isolate which corresponds 
20 to that of SEQ ID NO:l may vary from isolate to isolate. An exemplary BLAST search 
illustrating this principle is provided as Appendix B. For example, for GenBank 
accession number AF009606 a first nucleotide of a codon in a reading firame which 
corresponds to the reading fi:ame of SEQ ID N0:1 is nucleotide 346. 

As used herein, the term "hybridizes under high stringency" is intended to 
25 describe conditions for hybridization and washing under which nucleotide sequences at 
least 70 % homologous to each other typically remain hybridized to each other. 
Preferably, the conditions are such that sequences at least 75 %, 85%, or 95% identical 
to each other typically remain hybridized to each other. Such stringent conditions are 
known to those skilled in the art and can be found in Current Protocols in Molecular 
30 Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting 
example of stringent hybridization conditions are hybridization in 6X sodium 
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chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X 

SSC, 0. 1 % SDS at 50-65X. 

As used herein, the term "antibody" is intended to include immunoglobulin 
molecules and immunologically active portions of immunoglobulin molecules, i.e., 
5 molecules that contain an antigen binding site which specifically binds (immunoreacts 
with) an antigen, such as Fab and F(ab')2 fragments. The terms "monoclonal antibodies" 
and "monoclonal antibody composition", as used herein, refer to a population of 
antibody molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope of an antigen, whereas the term "polyclonal 

1 0 antibodies" and "polyclonal antibody composition" refer to a population of antibody 
molecules ibat contain multiple species of antigen binding sites capable of interacting 
with a particular antigen. A monoclonal antibody compositions thus typically display a 
single binding affinity for a particular antigen with which it immunoreacts. 

As used herein, the term "adjuvant" includes agents >^ich potentiate the immune 

15 response to an antigen. Adjuvants can be administered in conjunction with the subject 
polypeptides tp additionally augment the inunune response. 

As used herein, the term "enhancing an immune response" includes increasing T 
and/or B cell responses, i.e., cellular and/or humoral immune responses, by treatment of 
a subject using the claimed methods. In one embodiment, the claimed methods can be 

20 used to enhance T helper cell responses. In another embodiment, the claimed methods 
can be used to enhance cytotoxic T cell responses. The claimed methods can be used to 
enhance both primary and secondary immune responses. Preferably, the immune 
response is increased as compared to the response of inmiune cells to the antigen in the 
absence of treatment with the claimed methods. The immune response of a subject can 

25 be determined by, for example, assaying antibody production, immune cell proliferation, 
the release of cytokines, the expression of cell surface markers, cytotoxicity, enchanced 
ability to clear infection with HCV, etc. 

//. Novel HCV Polypeptides 
30 The novel HCV polypeptides of the invention are not derived fi-om an HCV 

polyprotein, i.e., the polypeptides of the present invention are not encoded by the 
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standard HCV ORF. These alternate reading frame polypeptides are translated from (or 
synthesized based on) a reading frame which is +1 or +2 to the standard HCV ORF. The 
position of the first nucleotide of an ORF in which these polypeptides are translated will 
vary slightly depending upon the isolate studied. For example, for the infectious clone 

5 (GenBank accession number AFO 11 75 1 ) the first nucleotide of the ORF in which the 
novel HCV polypeptides are translated is nucleotide 346, which is +5 relative to the 
standard HCV ORF. The first nucleotide of a codon of other, known or new isolates 
which results in a reading frame which corresponds to the reading firame of SEQ ID 
N0:1 can be determined, e.g., by performing a BLAST search using the nudeic acid 

1 0 sequence of SEQ ID NO : 1 as the query sequence as described above. 

Translation of the novel HCV polypeptides of the invention does not necessarily 
have to begin at a start AUG codon. For example, previous work has shown that the 
start AUG of HCV could be mutated to AUU or CUG with little effect on translation 
efficiency (Clarke, supra). Alternatively, RNA editing may be involved in generating an 

1 5 initiator codon. Translation of the novel HCV polypeptides may also begin at the 

initiation site of the standard HCV ORF with a frame shift into a different reading frame. 
Finally, translation may be initiated 5' of the AUG start codon of the standard ORF in 
any of the three reading fi^es, but shifted into the +1 reading frame (relative to the 
standard ORF) so as to yield production of peptides which are, in part, at least 60% 

20 identical to a portion of SEQ ID NO. 2. The intemal ribosome entry site (IRES) is a 
complex RNA structural element that includes part of the 5' untranslated region of HCV 
RNA and part of the adjacent coding region. It may induce frame shifting or 
translational by passing. 

In preferred embodiments, the polypeptides are encoded by a reading frame 

25 corresponding to the reading firame of SEQ ID NO: 1 in which the first nucleotide of 
SEQ ID N0:1 is the first nucleotide of a codon. This reading frame can encode a 
polypeptide of at least about 126 amino acids in length before a termination codon is 
reached. Table 1 presents an alignment of novel HCV polypeptides which are encoded 
in this reading frame from various HCV isolates, along with a majority sequence derived 

30 using the Clustal method of sequence alignment. Stop codons appear in certain of the 
isolates after amino acid 126. However, translation may proceed beyond these stop 
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codons. For example, in certain cases, these stop codons may be sequencing errors. 
Alternatively, readthrough can occur by mutation, altered transcription, RN A editing, 
frame shifting or ribosome slippage. Therefore, even in the polypeptides in Mdiich a stop 
codon appears, in certain embodiments of the invention, the novel HCV polypeptide 
5 may be longer, i.e., translation may proceed past a termination codon. Therefore, in the 
case of, e.g., the infectious clone AFOl 1751, translation of the polypeptide could 
terminate, for example, at position 1 63 or 1 86 of SEQ ID N0:2. When the novel HCV 
polypeptides of the invention are synthesized, these stop codons may be ignored. 

In other embodiments, the polypeptides of the invention have some pocentage 

10 identity to the sequence shown in SEQ ID N0:2. The percent identity between two 

nucleic acid or amino acid sequences can easily be calculated by dividing the number of 
identical bases or amino acids by the total number of bases or amino acids. Sequences 
are aligned to give the highest percent identity and yet provide an alignment which is 
biologically meaningfid. Sequences can be aligned manually or, preferably, using an 

1 5 algorithm. For example, in the case of amino acid sequences, a FASTA search can be 
performed of the Swiss Protein database using the BiosumSO.Cmp (scoring matrix). The 
gap creation penalty can be set, e.g., at 12 aiid the extension penalty can be set e.g., at 2. 
The joining threshold can be set, e.g., at 36; the optimization threshold can be set, e.g., at 
24; and the optimization width can be set, e.g., at 16. 

20 In certain embodiments, the novel HCV polypeptides comprise an amino acid 

sequence at least about 40-50% identical to the amino acid sequence shown in SEQ ID 
N0:2. In preferred embodiments, the novel HCV polypeptides comprise an amino acid 
sequence at least about 50-60% identical to the amino acid sequence shown in SEQ ID 
N0:2. In other preferred embodiments, the novel HCV polypeptides comprise an amino 

25 acid sequence at least about 60-70% identical to the amino acid sequence shown in SEQ 
ID N0:2. In more preferred embodiments, the novel HCV polypeptides comprise an 
amino acid sequence at least about 70-80% identical to the amino acid sequence shown 
in SEQ ID N0:2. In still more preferred embodiments, the novel HCV polypq>tides 
comprise an amino acid sequence at least about 80-90% identical to the amino acid 

30 sequence shown in SEQ ID N0:2. In another preferred etnbodiment, the novel HCV 
polypeptides comprise an amino acid sequence shown in SEQ ID N0:2. 
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In preferred embodiments, the polypeptides of the invention have the described 
percent identity over a length of at least about 10 amino acids. In more preferred 
embodiments, the percent identity of the polypeptides extends over a length of at least 
about 20-30 amino acids. In more preferred embodiment, the percent identity of the 
5 polypeptides extends over a length of at least about 30-40 amino acids. In a more 
preferred embodiment, the percent identity of the polypeptides extends over a length of 
at least about 40-50 amino acids. In another more preferred embodiment, the percent 
identity of the polypeptides extends over a length of more than 50 amino acids. In other 
preferred embodiments, the percent identity of the polypeptides extends over a length of 

10 more than 100 amino acids. 

In other embodiments, the novel HCV polypeptides comprise an amino acid 
sequence encoded by a nucleic acid molecule having some percentage identity to the 
nucleic acid molecule shown in SEQ ID NO: 1 . In certain embodiments, the novel HCV 
polypeptides comprise an amino acid sequence encoded by a nucleic acid molecule at 

1 5 least 70% identical shown in SEQ ID NO: 1 in which the polypeptide is encoded by the 
reading frame shown in SEQ ID NO: 1 . In preferred embodiments, the novel HCV 
polypeptides comprises an amino acid sequence encoded by a nucleic acid molecule at 
least 80% identical shown in SEQ ID NO: 1 in which the polypeptide is encoded by the 
reading frame shown in SEQ ID NO: 1 . In more prefen-ed embodiments, the novel HCV 

20 polypeptides comprise an amino acid sequence encoded by a nucleic acid molecule at 
least 90% identical shown in SEQ ID NO: 1 in which the polypeptide is encoded by the 
reading frame shown in SEQ ID N0:1 . In other embodiments, novel HCV polypeptides 
comprise an amino acid sequence encoded by a nucleic acid molecule shown in SEQ ID 
N0:1 in vMch the polypeptide is encoded by the reading frame shown in SEQ ID N0:1 . 

25 In certain embodiments, the novel HCV polypeptides of the invention are 

encoded by a nucleic acid molecule which hybridizes under stringent conditions to the 
nucleic acid sequence shown in SEQ ID NO: 1 . Stringent hybridization conditions are 
known in the art. In preferred embodiments, such polypeptides are encoded by a nucleic 
acid molecule which hybridizes under stringent conditions to a nucleic acid molecule 

30 from any HCV isolate, but which are read in or synthesized as if read in the reading 
firameofSEQIDNO: 1. 
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In other embodiments, the novel HCV polypeptides comprise at least a portion of 
an amino acid sequence selected from the group consisting of SEQ ID NO: 3, SEQ ID 
N0:4, SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, and SEQ ID N0:8 and cause an 
immune response in a subject. Other novel HCV polypeptides can be identified using an 

5 HCV nucleic acid sequence and determining the amino acids which are encoded in the 
+1 or +2 reading frame. Polypeptides of comprising these sequences can be made and 
assayed for reactivity with antibodies from infected subjects. Those polypeptides which 
bind to antibodies, i.e., have elicited an immune response in infected subjects are made 
by the virus during the course of infection and represent preferred novel HCV 

10 polypeptides. 

In still other embodiments, the invention pertains to isolated or recombinant 
polypeptides comprising an amino acid sequence selected from the group consisting of: 
LNLKEKP(X1)(X2)TPT(X3) and AAHRT(X4)SSR(X5)(X6)VR, wherein XI isN or K, 
X2 is V or E, X3 is A or V, X4 is L or S, X5 is A or V, and X6 is A or V. In yet other 

1 5 embodiments, the novel HCV polypeptides consist of an amino acid sequence selected 
from the group consisting of LNLKEKPNVTPTAC and AAHRTSSSRAVVRC. 

In certain embodiments of the invention the novel HCV polypeptides of the 
invention are at least about 8 amino acids to at least about 1 00 amino acids in length. In 
other embodiments, the polypeptides of the invention are at least about 10 amino acids 

20 to at least about 50 amino acids in length. In other embodiments, the polypeptides of the 
invention are at least about 14 to at least about 25 amino acids in length. 

In preferred embodiments, the novel HCV polypeptides are of a length suflBcient 
to cause an immune response in a subject. Such an immune response can be measured 
using techniques which are known in the art. For example, the inmiune response elicited 

25 by the HCV polypeptides of the invention can be a T cell-mediated response which can 
be measured by, e.g., cytokine production and/or cellular proliferation or cellular 
cytotoxicity and/or a B cell mediated response which can be measured, e.g., by antibody 
production. 

In certain embodiments the novel HCV polypeptides of the invention are made as 
30 fiision proteins. In addition to utilizing fiision proteins to enhance immunogenicity, 
fiision proteins can also facilitate the expression of proteins, including the novel HCV 
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polypeptides of the present invention. For example, a the novel HCV pqlypeptide can be 
generated as a glutathione-S-transferase (GST-fusion protein). Such GST-fusion 
proteins can enable easy purification of the novel HCV polypeptide, as for example by 
the use of glutathione-derivatized matrices (see, for example, Current Protocols in 
5 Molecular Biology, eds. Ausubel et al. (N.Y.: John Wiley & Sons, 1991)). In another 
embodiment, a fusion gene coding for a purification leader sequence, such as a poly- 
(His)/enterokinase cleavage site sequence, can be can be fused to a the novel HCV 
polypeptide, in order to permit purification of the poly(His)-the novel HCV polypeptide 
by affinity chromatography using a Ni2+ metal resin. The purification leader sequence 
1 0 can then be subsequently removed by treatmait with enterokinase (e.g., see Hochuli et 
al. (1987) J. Chromatography 41 1 :177; and Janknecht et al. PNAS 88:8972). 

Techniques for making fusion genes are known to those skilled in the art. 
Essentially, the joining of various DNA fi-agments coding for different polypeptide 
sequences is performed in accordance with conventional techniques, employing blunt- 
15 ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, fiUing-in of cohesive ends as appropriate, alkaline phosphatase 
treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, 
the fusion gene can be synthesized by conventional techniques including automated 
DNA synthesizers. Alternatively, PGR amplification of gene fragments can be carried 
20 out using anchor primers which give rise to complementary overhangs between two 
consecutive gene firagments which can subsequently be annealed to generate a chimeric 
gene sequence (see, for example. Current Protocols in Molecular Biology, eds. Ausubel 
et al. John Wiley & Sons: 1992). 

It will be understood that the preceding characteristics of HCV polypeptides are 

25 not mutually exclusive. 

///. Production of Novel HCV Polypeptides 

Novel HCV polypeptides can be produced by recombinant DNA techniques. For 
example, a nucleic acid molecule encoding such a polypeptide is cloned into an 
30 expression vector, the expression vector is introduced into a host cell and the novel HCV 
polypeptide is expressed in the host cell. The novel HCV polypeptide can dien be 



wo 99/63941 PCT/US99/12929 

-14- 

isolated from the cells by an appropriate purification scheme using standard protein 
purification techniques. As an alternative to recombinant expression, a novel HCV 
polypeptide can be synthesized chemically using standard peptide synthesis techniques 
or purchased commercially. Moreover, native novel HCV polypeptides can be isolated 
5 from cells (e.g., cultured human cells infected with HCV), for example using an 
antibody. 

A. Recombinant Production of Novel HCV Polypeptides 

In certain embodiments, the novel HCV polypeptides are encoded by a naturally- 

10 occurring HCV nucleic acid molecule. As used herein, a "naturally-occurring" nucleic 
acid molecule refers to an RNA molecule (or a DNA molecule derived therefrom) 
having a nucleotide sequence that occurs in nature (e.g., encodes a protein produced by a 
naturally occurring HCV isolate). 

In addition to naturally-occurring isolates of the novel HCV polypeptides, the 

15 skilled artisan will further appreciate that changes may be introduced by mutation, e.g., 
into an HCV nucleotide sequence thereby leading to changes in the amino acid sequence 
of the encoded HCV polypeptides. 

For example, an isolated nucleic acid molecule encoding a novel HCV 
polypeptide homologous to the polypeptide of SEQ ID NO: 2, i.e., having a certain 

20 percentage identity to the polypeptide of SEQ ID N0:2 can be created by introducing 
one or more nucleotide substitutions, additions or deletions into the nucleotide sequence 
of SEQ ID NO: 1 such that one or more amino acid substitutions, additions or deletions 
are introduced into the encoded polypeptide. Mutations can be introduced into SEQ ID 
NO: 1 by standard techniques, such as site-directed mutagenesis and PCR-mediated 

25 mutagenesis. Alternatively, such a polypq)tide can be chemically synthesized to yield a 
polypeptide with a change in amino acid sequence from that in the naturally occurring 
polypeptide. 

Preferably, no substitutions or conservative amino acid substitutions are made 
where the is high homology or identity in amino acid residues among the various 
30 isolates. A "conservative amino acid substitution" is one in which the amino acid 

residue is replaced with an amino acid residue having a similar side chain. Families of 
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amino acid residues having similar side chains have been defined in the art, including 
basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, 
threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). 

Alternatively, in another embodiment, mutations can be introduced randomly 
along all or part of a novel HCV polypeptide coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened, e.g., by testing for reactivity 
with antibodies from an individual with a past or present HCV infection. 



B. Expression Vectors and Host Cells 

The nucleic acid molecules described herein can be expressed in an expression 
vector to produce a novel HCV polypeptide. As used herein, the tenn "vector", refers to 
a nucleic acid molecule capable of transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid", which refers to a circular double stranded 
DNA loop into which additional DNA segments may be ligated. Another type of vector 
is a viral vector, wherein additional DNA segments may be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which 
they are introduced (e.g., bacterial vectors having a bacterial origin of replication and 
episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors are 
capable of directing the expression of genes to which they are operatively linked. Such 
vectors are referred to herein as "expression vectors". In general, expression vectors of 
utility in recombinant DNA techniques are often in the form of plasmids. In the present 
specification, "plasmid" and "vector" may be used interchangeably as the plasmid is the 
most commonly used form of vector. However, the invention is intended to include such 
other forms of expression vectors, such as viral vectors (e.g., replication defective 
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retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent 
functions. 

The recombinant expression vectors of the invention comprise a nucleic acid 
molecule as described herein in a form suitable for expression of the nucleic acid in a 

5 host cell, which means that the recombinant expression vectors include one or more 
regulatory sequences, selected on the basis of the host cells to be used for expression, 
which is operatively linked to the nucleic acid sequence to be expressed. Within a 
recombinant expression vector, "operably linked" is intended to mean that the nucleotide 
sequence of interest is linked to the regulatory sequence(s) in a mianner which allows for 

10 expression of the nucleotide sequence (e.g., in an in vitro transcription/translation system 
or in a host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to includes promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel; Gene Expression Technology: Methods in Enzymology 185, 

15 Academic Press, San Diego, CA (1990). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector may depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, etc. The 

expression vectors of the invention can be introduced into host cells to thereby produce 
novel HCV polypeptides, including fusion proteins comprising such polypeptides. 

The recombinant expression vectors of the invention can be designed for 
expression of novel HCV polypeptides in prokaryotic or eukaryotic ceils. For example, 

25 novel HCV polypeptides can be expressed in bacterial cells such as E. coli, insect cells 
(using baculovirus expression vectors) yeast cells or mammalian cells. Suitable host 
cells are discussed further in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the 
recombinant expression vector may be transcribed and translated in vitro, for example 

30 using T7 promoter regulatory sequences and T7 polymerase. 
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Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 

5 vectors typically serve three purposes: 1 ) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aidfin the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fiision 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fiision 
moiety and the recombinant protein to enable separation of the recombinant protein firom 

1 0 the fiision moiety subsequent to purification of the fiision protein. Such enzymes, and 
their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fiision expression vectors include pGEX (Phamiacia Biotech Inc; Smith, D.B. 
and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 

1 5 maltose E binding protein, or protein A, respectively, to the target recombinant protein. 
Examples of suitable inducible non-fiision £ coli expression vectors include 
pTrc (Amann et aL, (1988) Gene 69:301-315) and pET lid (Studier et aL, Gene 
Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, 
California (1 990) 60-89). Target gene expression from the pTrc vector relies on host 

20 RNA polymerase transcription from a hybrid trp-lac fiision promoter. Target gene 
expression from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase (T7 gnl). This viral 
polymerase is supplied by host strains BL21(DE3) orHMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the transcriptional control of the lacUV 5 

25 promoter. 

One strategy to maximize recombinant protein expression in E. coli is to express 
the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
30 strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 
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preferentially utilized in E. coli (Wada et al., (1992) Nuc. Acids Res. 20:21 1 1-21 18). 
Such alteration of nucleic acid sequences of the invention can be carried out by standard 
DNA synthesis techniques. 

In another embodiment, the novel HCV polypeptides expression vector is a yeast 

5 expression vector. Examples of vectors for expression in yeast S. cerivisae include 
pYepSecl (Baldari. et al., (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, 
(1982) Cell 30:933-943), pJRY88 (Schultz et al.. (1987) Gene 54:1 13-123), andpYES2 
(Invitrogen Corporation, San Diego, CA). 

Alternatively, novel HCV polypeptides can be expressed in insect cells uang 

10 baculovirus expression vectors. Baculovirus vectors available for expression of proteins 
in cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al., (1983) Mol. 
Cell Biol. 3:2156-2165) and the pVL series (Lucklow, V.A., and Summers, M.D., (1989) 
Virology 170:31-39). 

In yet another embodiment, a nucleic acid molecule encoding novel HCV 

1 5 polypeptides of the invention is expressed in mammalian cells using a mammalian 

expression vector. Examples of mammalian expression vectors include pCDM8 (Seed, 
B., (1987) Nature 329:840) and pMT2PC (Kaufman el al. (1987), EMBO J. 6:187-195). 
When used in manunalian cells, the expression vector's control functions are oftai 
provided by viral regulatory elements. For example, commonly used promoters are 

20 derived from polyoma, Adenovirus 2, cytomegalovirus and Simian Virus 40. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 

25 tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore (1989) EMBO J. 8:729-733) and immunoglobulins (Banerji et al. (1983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 

30 (e.g., the neurofilament promoter; Byrne and Ruddle (1989) Proc. Natl. Acad. Set. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916). 
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and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,316 and European Application Publication No. 264,166). Developmentally- 
regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss (1990) Science 249:374-379) and the a-fetoprotein promoter (Campes 

5 and Tilghman ( 1 989) Genes Dev. 3 :537-546). 

A recombinant expression vector is introduced into a suitable host cell. The 
terms "host cell" and "recombinant host cell" are used interchangeably herein. It is 
understood that such terms refer not only to the particular subject cell but to the progeny 
or potential progeny of such a cell. Because certain modificatioris may occur in 

10 succeeding generations due to either mutation or environmental influences, such progeny 
may not, in fact, be identical to the parent cell, but are still included within the scope of 

the term as used herein. 

A host cell may be any prokaryotic or eukaryotic cell. For example, novel HCV 
polypeptides may be expressed in bacterial cells such as E. coli, insect cells, yeast or 

1 5 mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells >da 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 

20 techniques for introducing foreign nucleic acid (e.g., DNA) into a host ceil, including 
calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated 
transfection, lipofection, or electroporation. Suitable methods for transforming or 
tiansfecting host cells can be found in Sambrook et al. {Molecular Cloning: A 
Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and 

25 other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fiaction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., resistance to antibiotics) is 

30 generally introduced into the host cells along with the gene of interest. Prefen^ed 
selectable markers include those which confer resistance to drugs, such as G418, 
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hygromycin and methotrexate. Nucleic acid encoding a selectable marker may be 
introduced into a host cell on the same vector as that encoding the polypeptide or may be 
introduced on a separate vector. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable 

5 marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
cuhure, can be used to produce (i.e., express) novel HCV polypeptides. Accordingly, 
the invention further provides methods for producing novel HCV polypeptides using 
these host cells. In one embodiment, the method comprises culturing the host cell of 

1 0 invention (into which a recombinant expression vector encoding novel HCV 

polypeptides has been introduced) in a suitable medium until a novel HCV polypeptide 
is produced. In another embodiment, the method further comprises isolating novel HCV 
polypeptides from the -medium or the host cell. 

15 C Chemical Synthesis of Novel HCV Polypeptides 

The novel HCV polypeptides can be chemically synthesized as is well known in 
the art. Moreover, the peptide can be substituted and/or derivatized to optimize stability. 
The subject polypeptides can also be synthesized as branched polypeptides, particularly 
for vaccine applications as is known in the art (see, e.g.. Peptides. Edited by Bemd Gutte 

20 Academic Press 1995. pp. 456-493). 

IV. Antibodies Which React With Novel HCV Polypeptides 

In yet another aspect, the invention pertains to an antibody which binds to a 
novel HCV polypeptide. A novel HCV polypeptide, or fragment thereof, can be used as 

25 an immunogen to generate antibodies that bind such a polypeptide using standard 

techniques for polyclonal and monoclonal antibody preparation. The invention provides 
numerous antigenic peptide fragments of novel HCV polypeptides for use as 
immunogens. Preferably, an antigenic peptide of such a polypeptide comprises at least 8 
amino acid residues of the amino acid sequence ^own in SEQ ID NO: 2 or the ARF #1 

30 polypeptide or ARF #2 polypeptide consensus sequences. Preferably, the antigenic 

peptide comprises at least 10 amino acid residues, more preferably at least 14 amino acid 
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residues, even more preferably at least 18 amino acid residues. Preferred polypeptides 
comprise the ARF #1 consensus sequence :LNLKEKP(X1)(X2)TPT(X3) or the ARF#2 
consensus sequence AAHRT(X4)SSR(X5)(X6)VR, wherein XI is N or K, X2 is V or E, 
X3 is A or V, X4 is L or S, X5 is A or V, and X6 is A or V polypeptide sequences. 

5 Other preferred HCV polypeptides comprise or consist of the sequence 
LNLKEKPNVTPTAC or AAHRTSSSRAVVRC. 

The subject HCV polypeptides are used to prepare antibodies by immunizing a 
suitable subject, (e.g., rabbit, goat, mouse or other mammal) with the immunogen. An 
appropriate immunogenic preparation can contain, for example,.lrecombinantly expressed 

1 0 novel HCV polypeptides or a chemically synthesized novel HCV polypeptide can be 
used. The preparation can further include an adjuvant, such as Freund's complete or 
incomplete adjuvant, or similar immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic HCV polypeptides preparation induces a polyclonal HCV 
polypeptides antibody response. 

1 5 Accordingly, another aspect of the invention pertains to antibodies which react 

with the novel HCV polypeptides. The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen binding site which specifically binds 
(immunoreacts with) an HCV polypeptides. The invention provides polyclonal and 

20 monoclonal antibodies that bind HCV polypeptides. The term "monoclonal antibody" or 
"monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope of a novel HCV polypeptide. A monoclonal 
antibody composition thus typically displays a single binding affinity for a particular 

25 HCV polypeptide with which it reacts. 

Polyclonal anti-HCV polypeptide antibodies can be prepared as desaibed above 
by immunizing a suitable subject with a HCV polypeptide inununogen or attenuated 
HCV virus, or can be obtained fh)m an infected individual. The anti-HCV polypeptide 
antibody titer in the immunized subject can be monitored over time by standard 

30 techniques, such as with an enzyme linked immunosorbent assay (ELISA) using 

immobilized HCV polypeptide. If desired, the antibody molecules directed against HCV 
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polypeptide can be isolated from the animal (e.g., from the blood) and further purified by 
well known techniques, such as protein A chromatography to obtain the IgG fraction. At 
an appropriate time after immunization, e.g., when the antibody titers are highest, 
antibody-producing cells can be obtained from the subject and used to prepare 

5 monoclonal antibodies by standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975, Nature 256:495-497) (see also, 
Brown et al. (1981 ) J. Immunol 127:539-46; Brown et al. (1980) J Biol Chem 255:4980- 
83; Yeh et al. (1976) PNAS 76:2927-31 ; and Yeh et al. (1982) Int. J. Cancer 29:269-75), 
the more recent human B cell hybridoma technique (Kozbor et al. (1983) Immunol 

1 0 Today 4:72), the EB V-hybridoma technique (Cole et al. (1985), Monoclonal Antibodies 
and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96) or trioma techniques. The 
technology for producing monoclonal antibody hybridomas is well known (see generally 
R. H. Kenneth, in Monoclonal Antibodies: A New Dimension In Biological Analyses, 
Plenum Publishing Corp., New York, New York (1 980); E. A. Lemer (1981) Yale J. 

15 Biol. Med, 54:387-402; M. L. Gefter et al. (1977) Somatic Cell Genet., 3:231-36). 
Briefly, an immortal cell line (typically a myeloma) is fused to lymphocytes (typically 
splenocytes) from a mammal immunized with a HCV polypeptide immunogen as 
described above, and the culture supematants of the resulting hybridoma cells are 
screened to identify a hybridoma producing a monoclonal antibody that binds the HCV 

20 polypeptide. 

Any of the many well known protocols used for fusing lymphocytes and 
immortalized cell lines can be applied for the purpose of generating an anti-HCV 
polypeptide monoclonal antibody (see, e.g., G. Galfire et al. (1977) Nature 266:55052; 
Gefter et al. Somatic Cell Genet., cited supra; Lemer, Yale J. Biol. Med, cited supra; 

25 Kenneth, Monoclonal Antibodies, cited supra). Moreover, the ordinary skilled worker 
will appreciate that there are many variations of such methods which also would be 
usefiil. Typically, the immortal cell line (e.g., a myeloma cell line) is derived from the 
same mammalian species as the lymphocytes. For example, murine hybridomas can be 
made by fusing lymphocytes from a mouse immunized with an immunogenic 

30 preparation of the present invention with an immortalized mouse cell line. Preferred 
immortal cell lines are mouse myeloma cell lines that are sensitive to culture medium 
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containing hypoxanthine, aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines may be used as a fusion partner according to standard 
techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8.653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from the American Type Culture Collection (ATCC), 
5 Rockville, Md. Typically, HAT-sensitive mouse myeloma cells are fused to mouse 
splenocytes using polyethylene glycol ("PEG"). Hybridoma cells resulting from the 
fusion are then selected using HAT medium, which kills unfiised and unproductively 
fused myeloma cells (unfused splenocytes die after several days because they are not 
transformed). 

1 0 Hybridoma cells producing a monoclonal antibody of the invention are detected 

by screening the hybridoma culture supematants for antibodies that bind HCV 
polypeptides, e.g., using a standard ELISA assay. 

Alternative to preparing monoclonal antibody-secreting hybridomas, a 
monoclonal anti-HCV polypeptide antibody can be identified and isolated by screening a 

1 5 recombinant combinatorial immunoglobulin library (e.g., an antibody phage display 
library) with HCV polypeptides to thereby isolate immunoglobulin library members that 
bind HCV polypeptides. Kits for generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant Phage Antibody System, 
Catalog No. 27-9400-01; and the Stratagene SurfZAP™ Phage Display Kit, Catalog 

20 No. 240612). Additionally, examples of methods and reagents particularly amenable for 
use in generating and screening antibody display library can be found in, for example, 
Ladner et al. U.S. Patent No. 5,223,409; Kang et al. International Publication No. WO 
92/18619; Dower et al. International Publication No. WO 91/17271; Winter et al. 
International Publication WO 92/20791 ; Markland et al. International Publication No. 

25 WO 92/15679; Breitling et al. International Publication WO 93/01288; McCafferty et al. 
International Publication No. WO 92/01047; Garrard et al. International Publication No. 
WO 92/09690; Ladner et al. International Publication No. WO 90/02809; Fuchs et al. 
(1991) Bio/Technology 9:\m'\'in\ Hay et al. (1992) Hum Antibod Hybridomas 3:81- 
85; Huse et al. (1989) Science 246:1275-1281; Griffiths et al. (1993) EMBO J \2 J25- 

30 734; Hawkins et al. (1992) JA/o/ Biol 226:889-896; Clarkson et al. (1991) Nature 
352:624-628; Gram et al. (1992) PNAS 89:3576-3580; Garrad et al. (1991) 
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Bio/Technology 9:1373-1377; Hoogenboom et al. (1991) Nuc Acid Res 19:4133-4137; 
Barbas et al. (1991) PNAS 88:7978-7982; and McCafferty et al. Nature (1990) 348:552- 
554. 

Additionally, recombinant anti-HCV polypeptide antibodies, such as chimeric 
5 and humanized monoclonal antibodies, comprising both human and non-hunian 
portions, which can be made using standard recombinant DNA techniques, are within 
the scope of the invention. Such chimeric and humanized monoclonail antibodies can be 
produced by recombinant DNA techniques known in the art, for example using methods 
described in Robinson et al. International Patent Publication PCT/US86/02269; Akira, et 

10 al. European Patent Application 1 84, 1 87; Taniguchi, M., European Patent Application 
171,496; Morrison et al. European Patent Application 173,494; Neuberger et al. PCT 
Application WO 86/01533; Cabilly et al. U.S. Patent No. 4,816,567; Cabilly et al. 
European Patent Application 125,023; Better et al. (1988) Science 240: 1041-1043; Liu 
et al. (1987) 84:3439-3443; Liu et al. (1987)7. Immunol. 139:3521-3526; Sun et 

15 al. (1987) 84:214-218; Nishimuraetal. (1987) Ca«c.^e5. 47:999-1005; Wood et 
al. (1985) Nature 314:446-449; and Shawet al. (1988) J. Natl Cancer Inst. 80:1553- 
1559); Morrison, S. L. (1985) Science 229:1202-1207; Oi et al. (1986) BioTechniques 
4:214; Winter U.S. Patent 5,225,539; Jones et al. (1986) Nature 321:552-525; 
Verhoeyan et al. (1988) Science 239:1534; and Beidler et al. (1988) J. Immunol. 

20 141:4053-4060. 

An anti-HCV polypeptide antibody (e.g., monoclonal antibody) can be used to 
isolate or detect HCV polypeptides by standard techniques, such as affinity 
chromatography, inununoprecipitation, ELISA, or RIA as is well known in the art. An 
anti-HCV polypeptide antibody can facilitate the purification of natural HCV 

25 polypeptide from cells and of recombinantly produced HCV polypeptides expressed in 
host cells. Moreover, an anti-HCV polypq)tide antibody can be used to detect HCV 
polypeptides from a body fluid of a subject which is suspected to have an HCV 
infection. Detection may be facilitated by coupling (i.e., physically linking) the antibody 
to a detectable substance. Examples of detectable substances include various enzymes, 

30 prosthetic groups, fluorescent materials, luminescent materials and radioactive materials. 
Examples of suitable enzymes include horseradish peroxidase, alkaline phosphatase, P- 
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galactosidase, or acetylcholinesterase; examples of suitable prosthetic group complexes 
include st^eptavidin^iotin and avidin/biotin; examples of suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of a 
luminescent material includes luminol; and examples of suitable radioactive material 
include 125i,131i, 35s ,,3h. 

Such antibodies can be incorporated in diagnostic kits and are also useful in 
passive immunization against HCV in patients which have an active HCV infection or 
are likely to be exposed to HCV. 



10 



JV. Uses of Novel HCV Polypeptides 

In another aspect, the invention pertains to a vaccine composition which is 
administered to a subject prior to exposure to HCV to preventing hepatitis C infection in 
the subject. In one embodiment, the vaccine comprises a novel HCV polypeptide of the 
1 5 invention. In another embodiment, the vaccine causes a novel HCV polypeptide of the 
invention to be synthesized in a subject. 



A. Vaccines 

Novel HCV polypeptide sequences appropriate for use in vaccine compositions 
20 for the prevention of HCV in a subject can easUy be determined. For example, epitopes 
which elicit an immune response can be identified by screening in an immunoassay 
against sera fiom patients vrith a past or ongoing HCV infection. Alternatively, 
immunogenic polypeptides can be identified by computer analysis to identify 
immunogenic epitopes. Finally, the full-length novel polypeptide could be used in a 
25 vaccine. 

In another embodiment, agents which are known adjuvants can be administered 
with the subject polypeptides. At this time, the only adjuvant widely used in humans 
has been alum (aluminum phosphate or aluminum hydroxide). Saponin and its purified 
component Quil A, Freund's complete adjuvant and other adjuvants used in research 
30 and veterinary applications have potential use in human vaccines. However, new 

chemically defined preparations such as muramyl dipeptide, monophosphoryl lipid A, 
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phospholipid conjugates such as those described by Goodman-Snitkoff et al. J. Immunol. 
147:410-415 (1991) resorcinols, non-ionic surfactants such as polyoxyethylene oleyl 
ether and n-hexadecyl polyethylene ether, enzyme inhibitors include pancreatic trypsin 
inhibitor, diisopropylfluorophosphate (DEP) and trasyiol can also be used. In 
5 embodiments in which antigen is administered, the antigen can e.g., be enc2q)sulated 
within a proteoliposome as described by Miller et al., J. Exp. Med. T76: 1739- 1744 
(1992) and incorporated by reference herein, or in lipid vesicles, such as Novasome TM 
lipid vesicles (Micro Vescular Systems, lnc.,Nashua, N. H.), to further enhance immune 
responses. 

10 In yet other embodiments, as an alternative to administering the novel HCV 

polypeptide, the polypeptide can be synthesized by the subject. This can be done using a 
plasmid DNA construct which is similar to those used for delivery of reporter or 
therapeutic genes. Such a construct preferably comprises a bacterial origin of replication 
that allows amplification of large quantities of the plasmid DNA; a prokaryotic 

1 5 selectable marker gene; a nucleic acid sequence encoding a novel HCV polypeptide or 
portion thereof; eukaryotic transcription regulatory elements to direct gene expression in 
the host cell; and a polyadenylation sequence to ensure appropriate termination of the 
expressed mRNA (Davis. 1997. Curr, Opin. BiotechnoL 8:635). Vectors used for DNA 
immunization may optionally comprise a signal sequence (Michel et al. 1995. Proc, 

20 Natl Acad, Sci USA. 92:5307; Donnelly et al. 1996. 1 Infect Dis. 173:314). DNA 
vaccines can be administered by a variety of means, for example, by injection (e.g., 
intramuscular, intradermal, or the biolistic injection of DNA-coated gold particles into 
the epidermis with a gene gun that uses a particle accelerator or a compressed gas to 
inject the particles into the skin (Haynes et al. 1996. J. BiotechnoL 44:37)). 

25 Alternatively, DNA vaccines can be administered by non-invasive means. For example, 
pure or lipid-formulated DNA can be delivered to the respiratory system or targeted 
elsewhere, e.g., Peyers patches by oral delivery of DNA (Schubbert. 1997. Proc. Natl. 
Acad ScL USA 94:961). Attenuated microorganisms can be used for delivery to 
mucosal surfaces. (Sizemore et al. 1995. Science. 270:29) 
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Any of the instant vaccine compositions can comprise (or encode) one or more 
epitopes (either contiguous or non contiguous) of a novel HCV polypeptide. Such 
preparations can further comprise polypeptide sequences derived from an HCV 
polyprotein sequence. In other embodiments, such a vaccine composition can further 
5 comprise a compound which will enhance the immunologocial reactivity of die novel 
HCV polypeptide epitope. For example, the immunogenicicty of the novel HCV 
polypeptides may be enhanced by making a fusion proteins comprising a novel HCV 
polypeptide fused to a different polypeptide, i.e., not a novel HCV polypeptide. 
Techniques for making such fusion proteins are known in the art. Alternatively, a 
1 0 vaccine can comprise an immunoregulatory molecule, such as a cytokine. For example, 
in one embodiment, plasmids for DNA vaccination can express a single imnranogen, or 
two sequences can be coexpressed. In one embodiment, the additional sequences can be 
additional immunogens (novel HCV polypeptides or HCV polyprotein polypeptides or 
other polypeptides) or can encode modulators of immune responses such as lymphokine 
1 5 genes or costimulatory molecules (Iwasaki et al. 1 99.7. J. Immunol. 1 58:4591) 
Typically, vaccine compositions of the present invention are prepared as 
injectables, either as liquid solutions or suspensions. Solid forms suitable for solution 
in, or suspension in, liquid prior to injection may also be prepared. The con^wsition 
may also be emulsified, or the polypeptide encapsulated into liposomes. Hie 
20 polypeptide may be mixed with pharmaceutically acceptable excipients, for example, 
water, saline, dextrose, glycerol, ethanol, or the like. The composition may also 
comprise minor amounts of, for example, wetting agents, pH buffering agenls and/or 
adjuvants, such as aluminum hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamine 
(thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637 ornor-MDP), 
25 N-acetylmuramlyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(r-2'-dipalmitoyl-sn-glycerol- 

3-hydroxyphosphoryloxy)ethylamine (CGP 19835 A, or MTP-PE), or bacterial 
components. 

Such vaccine compositions are generally administered parenterally, by injection, 
usually wither subcutaneously or intramuscularly. Other formulations may be 
30 administered orally, by inhalation or as suppositories. 
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The polypeptides may be incorporated into the vaccine in a neutral or salt form. 
Pharmaceutically acceptable salts include the acid addition salts (formed with free amino 
groups of the polypeptide) and which are formed with inorganic acids, such as, for 
example, hydrochloric or phosphoric acids, or such organic acids such as acetic, oxalic, 
5 tartaric, maleic, or the like. Salts formed with the free carboxyl groups may also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
irimethylamine, 2-ethylamino ethanol, histidine, procaine, or the like. 

The vaccines are administered so as to be compatible wifli the dosage 
10 fomiulation, and in such an amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered depends on the subject to be treated, the 
capacity of the subject's immune system to mount an immune response to the vaccine, 
and the degree of protection desired. The range of 5 |Lig to 250 jig of antigen per dose, 
however, is often appropriate. The vaccine compositions may be given in a single dose 
15 or in multiple doses. Appropriate doses are well within the skill of the art to determine, 
and do not constitute undue experimentation. 

In still another embodiment, the invention pertains to a method of preventing 
HCV in a subject by administering a novel HCV polypeptide to a subject or by causing a 
novel HCV polypeptide to be expressed in a subject. 

20 

B. Diagnostic Kits 

In another aspect of the invention, methods for diagnosii^ HCV infection e.g., 
either a past or present infection, and diagnostic kits are provided for detecting an 
infection with HCV. 

25 In one embodiment, the invention provides a method of diagnosing HCV 

infection by detecting the presence or absence of antibodies in Ae body fluid of a subject 
which bind to a novel HCV polypeptide. In one embodiment the method comprises 
incubating a test sample under conditions which allow the binding of a novel HCV 
polypeptide and an antibody in the test sample of body fluid and detecting the binding of 

30 polypeptide and antibody. 
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Test samples can be derived from any appropriate body fluid or tissue 
preparation, for example, whole blood, plasma, serum, spinal fluid, lymph fluid, tears, 
saliva, milk, or liver tissue preparations. 

Detection of the binding between a novel HCV polypeptide of the 

5 invention and an antibody can be accomplished using any technique which is known in 
the art and can be facilitated using antibodies labelled as described above. 

Antibodies which bind to novel HCV polypeptide can be detected using a 
number of different screening assays known in the art, such as an enzyme-linked 
immunosorbent assay (ELISA), a radioimmunoassay (RIA), or aWestem Blot Assay. 

10 Each assay generally detects the presence of protein-antibody complexes of particular 
interest by employing a labeled reagent (e.g., an antibody) specific for the complex of 
interest. Accordingly, in the present invention, these assays are used to detect novel 
HCV polypeptide-antibody complexes formed between immunoglobulins (e.g., human 
IgG, IgM and IgA) contained in a biological sample and a novel HCV polypeptide. As 

15 will be described below, these protein-antibody complexes are preferably detected using 
an enzyme-linked antibody or antibody fragment (e.g., a monoclonal antibody or 
fragment thereof) which recognizes and specifically binds to the polypeptide-antibody 
complexes. 

In one embodiment of the method, a sandwhich ELISA assay is used. 

20 For example, a novel HCV polypeptide with or without conjugation to a carrier, such as 
activated BSA is immobilized on a plate. A body fluid sample from an individual is 
contacted with a novel HCV polypeptide under conditions which allow binding of the 
antibodies in the sample to the polypeptides. The sample is then removed, and any 
antibody which has bound to the HCV polypeptide is detected by contacting the sample 

25 with a labeled secondary antibody or antibody fragment which binds to an antibody 
which might be present in the subjects sample, e.g., an anti-human antibody. The 
unbound secondary antibody is removed and the presence of secondary antibody which 
remains bound is detected, e.g., using a label as described above. Possible controls for 
use in the method include body fluids from uninfected subjects and polypeptides which 

30 are not novel HCV polypeptides. In accordance with the present invention, the presence 
of such an antibody is indicative of an infection with HCV. 
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In another embodiment of the assay the test sample can be tested for the 
presence of novel HCV polypeptides using known antibodies. In these embodiments, 
antibodies that bind to novel HCV polypeptides are used to detect the presence of novel 
HCV polypeptides in the body fluid of a subject or in a cell of a subject. In performing 

5 such an assay, the antibodies which bind to a novel HCV polypeptide are contacted with 
a cell or body fluid of a subject under conditions where a novel HCV polypeptide in the 
subject's sample can bind to the antibody. Unboimd antibody is removed and bound 
antibody is detected. Any of the antibodies described above can be used in practicing 
this method. Preferred antibodies for use in the methods of this embodiment are highly 

1 0 specific, including monospecific and, more preferably, monoclonal antibodies or 
fragments thereof. In preferred embodiments, such antibodies are labelled. In other 
embodiments, sud» antibodies are detected by employing a secondary antibody which 
binds to them and not to the test sample from a subject. The presence of a novel HCV 
polypeptide in the subject's sample is indicative of an infection with HCV. 

1 5 In yet another aspect, the present invention provides an assay kit for 

diagnosing HCV infection in a subject. Preferably, the kit contains a solid support (e.g., 
an ELISA plate) capable of adsorbing immunoglobulin (e.g., IgG, IgM and IgA) from a 
subject's sample (preferably a human biological sample, such as a body fluid) and a 
monoclonal antibody or fragment thereof specific for a novel HCV polypeptide. In 

20 another embodiment, the solid support can be omitted from the kit. In another 

embodiment, the kit contains a solid support (e.g., an ELISA plate or a slide) and a 
monoclonal antibody or fi:agment thereof specific for a novel HCV polypeptide. In other 
embodiments, the solid support can be omitted. The assay kit can optionally include 
instructions, or additional reagents such as a solution for washing unbound proteins from 

25 the solid support, and materials needed for performing a detection assay. 



C TargOs for Therapeutic Intervention 

The novel HCV polypeptides of the invention are also targets for anti-HCV 
therapy. As such the invention provides methods for identifying compounds which 
30 interact with a novel HCV polypeptide and, thus, are likely to interfere with infection. 
In one embodiment, the method involves contacting the polypeptide with a compound in 
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a cell-free system under conditions which allow interaction of the compound with the 
polypeptide such that a complex is formed. The complexes of polypeptide and 
compound can then be separated from the compounds which do not bind to the HCV 
polypeptide, the compounds which bind to HCV polypeptides can then be isolated and 
5 identified. 

Exemplary compounds which can be screened for activity in the subject assays 
include, but are not limited to, peptides, nucleic acids, carbohydrates, small organic 
molecules, and natural product extract libraries. The term "non-peptidic compound" is 
intended to encompass compounds that are comprised, at least in part, of molecular 
10 structures different from naturally-occurring L-amino acid residues linked by natural 
peptide bonds. However, "non-peptidic compounds" are intended to include compounds 
composed, in whole or in part, of peptidomimetic structures, such as D-amino acids, 
non-naturally-occurring L-amino acids, modified peptide backbones and the like, as well 
as compounds that are composed, in whole or in part, of molecular structures unrelated 
1 5 to naturally-occurring L-amino acid residues linked by natural peptide bonds. "Non- 
peptidic compounds" also are intended to include natural products. 

A recent trend in medicinal chemistry includes the production of mixtures of 
compounds, referred to as libraries. While the use of libraries of peptides is well 
established in the art, new techniques have been developed which have allowed the 
20 production of mixtures of other compounds, such as benzodiazepines (Bunin et al. 1 992. 
J. Am. Chem. Soc. 1 14:10987; DeWitt et al. 1993. Proc. NaU. Acad. Sci. USA 90:6909) 
peptoids (Zuckermann. 1994. J. Med. Chem. 37:2678) oUgocarbamates (Cho et aL 
1993. Science. 261:1303), and hydantoins (DeWitt et al. supra). Rebek et aL have 
described an approach for the synthesis of molecular libraries of small organic molecules 
25 with a diversity of 104-105 (Carell et al. 1994. Angew. Chem. Int. Ed. Engl. 33:2059; 
Carell et al. Angew. Chem. Int. Ed. Engl. 1994. 33:2061). 

The compounds of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods known in the art, including: 
biological libraries; spatially addressable parallel sohd phase or solution phase libraries, 
30 synthetic library methods requiring deconvolution, the 'one-bead one-compound' library 
method, and synthetic library methods using affinity chromatography selection. The 
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biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries 
ofcompounds(Lam, K.S. Anticancer Drug Des. 1997. 12:145). 

In one embodiment, the test compound is a peptide or peptidomimetic. In 
5 another, preferred embodiment, the compounds are small, organic non-peptidic 
compounds. 

Other exemplary methods for the synthesis of molecular libraries can be found in 
the art, for example in: Erb et al. 1994. Proc. Natl. Acad. Sci. USA 91 :1 1422; Horwell 
et al. 1996 Immunopharmacology 33:68; and in Gallop et al. 1994. J. Med. Chem. 

10 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques 13:412-421), or on beads (Lam (1991) Nature 354:82-84), chips (Fodor 
(1993) Nature 364:555-556), bacteria (Ladner USP 5,223,409), spores (Ladner US? 
'409), plasmids (Cull et al. (1 992) Proc Natl Acad Sci USA 89: 1 865-1 869) or on phage 

15 (Scott and Smith (1990) Science 249:386-390); (Devlin (1990)5c/^^7C^ 249:404-406); 
(Cwirla et al. (1990) Proc, Natl Acad Sci. 87:6378-6382); (Felici (1991)7. MoL Biol. 
222:301-310); (Udner supra). 

In many drug screening programs which test libraries of compounds and 
natural extracts, high throughput assays are desirable in order to maximize the number of 

20 compounds surveyed in a given period of time. Assays which are performed in cell-free 
systems, such as may be derived with purified or semi-purified proteins, are often 
preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by a 
test compound. Accordingly, in an exemplary screening assay of the present invention, 

25 the compound of interest is contacted with a novel HCV polypeptide. Detection and 
quantification of novel HCV polypeptide/compound complexes identifies the compound 
as a potential modulator of a novel HCV polypeptide. The efficacy of the compound can 
be assessed by generating dose response curves from data obtained using various 
concentrations of the test compound. Moreover, a control, e.g., using a different 

30 polypeptide can also be performed to provide a baseline for comparison. 
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Complex fonnation between the novel HCV polypeptide and a compound 
may be detected by a variety of techniques. For instance, modulation of the formation of 
complexes can be quantitated using, for example, detectably labelled proteins such as 
radiolabelled, fluorescently labelled, or enzymatically labelled novel HCVpolypeptides, 

5 by immunoassay, or by chromatographic detection. 

Typically, it will be desirable to immobilize either the novel 
HCVpolypeptide or the compound or both to facilitate separation of compound/novel 
HCVcomplexes from uncomplexed forms, as well as to accommodate automation of the 
assay. In one embodiment, a fusion protein can be provided which adds a domain that 

10 allows the polypeptide to be bound to a matrix. For example, glutathione-S- 

transferase/receptor (GST/receptor) fusion protein forms of the novel polypeptides can 
be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the test 
compound which is bound, to beads and incubated under conditions conducive to 

1 5 complex formation, e.g. at physiological conditions for salt and pH, though slightly 
more stringent conditions may be desired, e.g., at 4"C in a buffer containing 0.6M NaCl 
or a detergent such as 0.1% Triton X-100. Following incubation, the uncomplexed 
fomis are removed by washing and compounds which bind to novel HCV polypeptides 
are identified. 

20 Other techniques for immobilizing proteins on matrices are also available 

for use in the subject assay. For instance, the novel HCV polypeptides can be 
immobilized utilizing conjugation of biotin and streptavidin. For instance, biotinylated 
novel HCV polypeptides can be prepared from biotin-NHS (N-hydroxy-succinimide) 
using techniques well known in the art (e.g., biotinylation kit. Fierce Chemicals, 

25 RockfoKl, IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). Alternatively, antibodies reactive with the novel HCV polypeptides can be 
derivatized to the wells of the plate, and the novel HCV polypeptides trapped in the 
wells by antibody conjugation. As above, preparations of a novel HCV polypeptide and 
a test compound are incubated in the wells of the plate, and the amount of novel HCV 

30 polypeptides/ compound complex trapped in the well can be quantitated. 
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Other exemplary methods for detecting such complexes, in addition to 
those described above for the GST-immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the novel HCV polypeptide, or which are 
reactive with the receptor protein and compete for binding with Ac novel HCV 

5 polypeptide; as well as enzyme-linked assays which rely on detecting an enzymatic 
activity associated with the novel HCV polypeptide. In the instance of the latter, the 
enzyme can be chemically conjugated or provided as a fusion protein with the novel 
HCV polypeptide. To illustrate, the novel HCV polypeptide can be chemically cross- 
linked or genetically fused with alkaline phosphatase, and the amount of novel HCV 

1 0 polypeptide trapped in the complex can be assessed with a chromogenic substrate of the 
enzyme, e.g. paranitrophenylphosphate. Likewise, a fusion protein comprising the novel 
HCV polypeptide and glutathione-S-transferase can be provided, and complex formation 
quantitated by detecting the GST activity using l-chloro-2,4-dinitrobenzene (Habig et al 
(1974) J Biol Chem 249:7130). 

1 5 For processes which rely on immunodetection for quantitating one of the proteins 

trapped in the complex, antibodies against the protein, such as the anti-novel HCV 
antibodies described herein, can be used. Alternatively, the protein to be detected in the 
complex can be "epitope tagged" in the form of a fusion protein which includes, in 
addition to the novel HCV polypeptide, a second polypeptide for which antibodies are 

20 readily available (e.g. from commercial sources). For instance, the GST fusion proteins 
described above can also be used for quantification of binding uang antibodies against 
the GST moiety. Other useful epitope tags include myc-epitopes (e.g., see Ellison et al. 
(1991) y Biol Chem 266:21 150-21 157) which includes a 10-residue sequence from c- 
myc, as well as the pFLAG system (International Biotechnologies, Inc.) or the pEZZ- 

25 protein A system (Pharamacia, NJ). 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of cell biology, cell culture, molecular Wology, microbiology, 
recombinant DNA, and immunology, v»duch are within the skill of the art Such 
techniques are explained fiilly in the literature. See, for example. Genetics; Molecular 

30 Cloning A Laboratory Manual, 2nd Ed., ed. by Sambrook, i.etal. (Cold Spring Harbor 
Laboratory Press (1989)); Short Protocols in Molecular Biology, 3rd Ed., ed. by 
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Ausubel, F. et al. (Wiley, NY (1995)); DNA Cloning, Volumes I and II (D. N. Glover 
ed., 1985); Oligonucleotide Synthesis (M. J. Gait ed. (1984)); Mullis etal. U.S. Patent 
No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. (1984)); 
the treatise. Methods In Enzymology (Academic Press, Inc., N.Y.); Immunochemical 
5 Methods In Cell And Molecular Biology (Mayer and Walker, eds.. Academic Press, 
London (1987)); Handbook Of Experimental Immunology, Volumes I-IV (D. M. Weir 
and C. C. Blackwell, eds. (1986)); and Miller, J. Experiments in Molecular Genetics 
(Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (1972)). 

1 0 The contents of all references, pending patent applications and published patents, 

cited throughout this application are hereby expressly incorporated by reference. 
Specifically, the contents of USSN 60/088,670, titled Novel Hepatitis C Vims Peptides 
And Uses Thereof, filed on June 9, 1998 and 60/089,138, titled Novel Hepatitis C Virus 
Peptides And Uses Thereof, filed on June 11, 1998 iare incorporated herein by this 

15 reference. 

The invention is further illustrated by the following examples, which should not 
be construed as fiirther limiting. 

20 Examples 

Example 1. The detection of antibodies against novel HCV polypejatides. 

Consensus polypeptides were synthesized based on the sequence homology 
25 between novel HCV polypeptides shown in Table 1 . The following peptides were made 
using conventional techniques by Biosynthesis Corp. (Lewisville, TX): 
LNLKEKPNVTPTAC and AAHRTSSSRAVVRC (altemate reading frame (ARF) 
polypeptides 1 and 2, respectively). The following polypeptides derived ftom HCV 
CORE protein were used as controls: PTDPRRRSRNLGKVIDTC and 
30 GCATRKTSERSQPRGRRAPI. The peptides were conjugated to activated bovine 

serum albumin giving up to 4 peptide molecules for each BSA molecule. Peptide-BSA 
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conjugates at a concentration of approximately 0.5 mg/ml were shipped on ice in 3 mis 
of phosphate buffered saline, 0.1 M, pH 7.4. They were aliquoted and stored at -20^0 
imtil use. 

ELISAs were perfomied as described by Kirkegaard and Perry Laboratories, Inc. 
5 (Gaithersburg, MD). For use in the ELISA, the polypeptides were dissolved in IX 
"coating buffer" to give a peptide-BSA concentration of 1 |ig/ml; TOO iil were added to 
96 well microliter plates (Falcon 3072, Becton Dickinson and Company, Franklin Lakes, 
NJ) and incubated at room temperature for 1 hour or overnight at 4^C. The coating 
solution was removed and plates were then blocked by adding 3^00 |j.1 of 1% BSA in 
10 phosphate buffered saline (as prepared by Kirkegaard and Perry), and incubating for 30 
min. at room temperature. The blocking solution was removed and 100 |il of 1% BSA 
(IX) was added. 

Sera was obtained from patients which were known to have or to have previously 
had an HCV infection. Serum samples were diluted 1:100 in IX BSA and 100 [il was 

1 5 added to the first w^U of each (yielding a total of 200 ]li1). Serum samples were then 
serially diluted (two-fold at each step). Control wells contained BSA only. Plates were 
incubated for 1 hour at room temperature with moderate agitation to allow binding. 
Plates were washed 5 times with IX wash solution (PBS and 0.02% Tween). After 
washing, 100 |il of the secondary antibody was immediately added and allowed to react 

20 for 1 hour at room temperature. The secondary antibody was either the Fab fragment of 
anti-human IgG conjugated to horse radish peroxidase (HRP) at a dilution of 1:1000, or 
anti-human IgG (in PBS with 1% BSA). Plates were washed 5 times with IX washing 
solution, and then 100 ^1 of hydrogen peroxide and TMB were added and allowed to 
react for 10 to 30 minutes at room temperature. To stop the reaction, 100 nl of IM 

25 phosphoric acid were added. O.D. measurements were obtained by using a dual 
wavelength scanner: 450 nm values-650 nm values. 

Results of ELISA Tests for Antibodies of HCV-Specific Polypeptides 



30 



Control Sera 



HCV Patient Sera 
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MeanOD(S.D.) Mean OD(S.D.) 

(N=2, tested in duplicate) (N=6, tested in duplicate) 



C0RE#1 



CORE #2 



ARF#1 



ARF#2 



0.379 (0.063) 
0.775 (0.053) 
0.763 (0.133) 
0.401 (0.039) 



0.851 (0.231) 
1.390(0.400) 
2.286(0.215) 
0.813(0.813) 



10 



15 



20 



25 



Example 2. Development of a western blot assay for detecting antibody production to 
alternate HCV reading frame proteins. 

Five micrograms of BSA-altemate reading frame pqitide conjugates and 
appropriate contols (quenched BS A, BS A-BS A peptide conjugates) were denatured 
by incubating in standard Lamelli loading buffer (with beta-mercq)toethanol 
and SDS), at 100°C for three minutes. Samples were then cooled and spun in a 
microcentrifuge prior to loading on a discontinuous 1 .5 ram thick SDS-PAGE gel 
(4% stacking gel [pH 6.8]and 12.5% [pH 8.4] resolving gel). The running 
buffer was TRIS-glycine/SDS (0.1%), with an unadjusted pH of approximately 8.4. 

Samples were run through the stacking gel (1 cm) for approximately 45 mins at 
90 volts. After the bromophenol blue (BE) entered the resolving gel, the voltage was 
increased to 160 V. The gels were run for approximately 1.5 hrs, or until the 
BB was 2/3 of the way through the gel. After stopping the run, the gels were 
either stained with coomassie blue, or equlibrated for 15 minutes with the 
transfer buffer (IX CAPS, ph 1 1.0, 10% MeOH. PVDF membranes (Immobilon-P 
[0.45 micon pore size], were wet in 100% MeOH for 1 min, then 50:50 
MeOH:double distilled water for 5 mins, and finally equilibrated with the transfer buffer. 
The transfer was set up in transfer buffer (filter paper, gel , PVDF 
membrane, and filter paper), and slipped into the BIO-RAD transfer tank. The 
transfer was run at 20 volts, overnight (approximately 10 hrs), at 4''C, with stirring. 

After transfer, the tank was disassembled, the membrane was soaked in 100% 
MeOH for one minute, then allowed to air-dry. In order to visualize the 
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efficiency of the transfer, the membranes (after re-wetting in MeOH then 
water) were incubated with 1% Ponceau-S red stain, and rinsed in double distilled water. 
After scanning the image, the dye was removed in a dilute NaOH solution (1ml 
saturated NaOH in 1 OOmls ddwater. 
5 The membranes were then blocked in 3% NFDM (6 grams non-fat dried milk in 

200mls of Ix TBS [TRIS-bufifered saline, ph 7.4), for one hour. After 
blocking, the membranes were rinsed in two washes of IxTBS, 200 mis apiece. 

The membranes were then incubated with the primary antibody solution 4 mis of 
a 1/200 dilution of patient sera in 1%NFDM in IXTTBS (TBS^with 0.025% 
1 0 Tween-20). This incubation lasted one hour, at 30oC, in glass with slow rotation of the 
tube or beaker. After this incubation, the membranes were washed three times 
in 200 mis of IXTTBS, 

The secondary antibody solution was 200 mis of a 1/3000 dilution of the 
BIO-RAD goat anti-human alkaline phosphatase conjugate, in 1% NFDM in IX 
15 TTBS. This incubation lasted 1 hr, at 30oC, with gentle shaking. 

After this incubation, the membranes were washed twice with 200 ml of 
IX TTBS (5 mins apiece), and one with 200 mis of IX TBS. 

The bands were visualized with the BIO-RAD AP substrate kit (200 mis total), 
with gentle, occasional shaking. After visualization, the membranes were 
20 washed several times with ddwater, followed by one wash with MeOH, then 
air-dried and photographed or scanned. 
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What is claimed is; 

1 . An isolated or recombinant polypeptide or fiagment thereof encoded by a nucleic 
acid molecule derived from a hepatitis C virus, having at least one of the following 

5 characteristics: 

1 ) at least a portion of the polypeptide is encoded by a reading frame +1 or +2 
relative to the standard hepatitis C virus open reading frame; 

2) at least a portion of the polypeptide is encoded by a reading frame 
corresponding to the reading frame of SEQ ID NO: 1 in which tKe first nucleotide of 

1 0 SEQ ID NO: 1 is the first nucleotide of a codon; 

3) at least a portion of the polypeptide comprises an amino acid sequence at least 
60% identical to the amino acid sequence shown in SEQ ID N0:2; and 

4) at least a portion of the polypeptide comprises an amino acid sequence 
encoded by a nucleic acid molecule which hybridizes under high stringency to the 

15 nucleotide sequence shown in SEQ ID NO: I. 

2. The polypeptide or portion thereof of claim 1 , wherein said polypeptide is at least 
about 8 amino acids to at least about 100 amino acids in length. 

20 3 . The polypeptide or portion thereof of claim 2, wherein said polypeptide is at least 
about 14 amino acids to at least about 30 amino acids in length. 

4. The polypeptide or portion thereof of claim 1 , wherein said polypeptide is 
encoded by a reading frame +1 or +2 to the standard hepatitis C reading frame. 

25 

5. The polypeptide or portion thereof of claim I , wherein said polypeptide is 
encoded by a reading frame corresponding to the reading frame of SEQ ID NO: 1 in 
which the first nucleotide of SEQ ID N0:1 is the first nucleotide of a codon. 
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6. The polypeptide or portion thereof of claim 5, wherein said polypeptide or 
portion thereof is encoded by the nucleic acid molecule of SEQ ID N0:1 and causes an 
immune response in a subject. 



5 7. The polypeptide or portion thereof of claim 1 , wherein said polypeptide 
comprises an amino acid sequence at least 60% identical to the amino acid sequence 
shown in SEQ ID N0:2 and causes an immune response in a subject. 

8. The polypeptide or portion thereof of claim 1 , wherein said polypeptide 

10 comprises an amino acid sequence at least 90% identical to the amino acid sequence 
shown in SEQ ID N0:2 and causes an immune response in a subject. 

9. The polypeptide or portion thereof of claim 1 , wherein said polypeptide 
comprises an amino acid sequence shown in SEQ ID NO: 2 which polypeptide causes an 

1 5 immune re^nse in a subject. 

1 0. The polypeptide or portion thereof of claim 1 , wherein said polypeptide 
comprises an amino acid sequence encoded by a nucleic acid molecule which hybridizes 
under high stringency to the nucleotide sequence shown in SEQ ID NO: 1 . 

20 

1 1 . The polypeptide or portion thereof of claim 1 which polypeptide comprises at 
least a portion of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 3, SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO:6, SEQ ID N0:7, and SEQ ID N0:8 
and causes an immune response in a subject. 

25 

12. An isolated or recombinant polypeptide comprising an amino acid sequence 
selected from the group consisting of: LNLKEKP(Xl)(X2)TPTpa) and 
AAHRT(X4)SSR(X5)(X6)VR, wherein XI is N or K, X2 is V or E, X3 is A or V, X4 is 
L or S, X5 is A or V, and X6 is A or V. 



30 
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13. A polypeptide consisting of an amino acid sequence selected from the group 
consisting of LNLKEKPNVTPTAC and AAHRTSSSRAVVRC. 

14. A vaccine composition for preventing hepatitis C infection in'a subject 
5 comprising the pol3T>eptide of claim 1 . 

15. A vaccine composition for preventing hepatitis C infection in a subject 
comprising the polypeptide of claim 2. 

10 16. A vaccine composition for preventing hepatitis C infection in a subject 
comprising the polypeptide of claim 4. 

1 7. A vaccine composition for preventing hepatitis C infection in a subject 
comprising the polypeptide of claim 7. 

15 

18. A vaccine composition for preventing hepatitis C infection in a subject 
comprising the polypeptide of claim 12. 

1 9. A vaccine composition for preventing hepatitis C infection in a subject 
20 comprising a nucleic acid encoding polypeptide of claim 1 . 

20. A vaccine composition for preventing hepatitis C infection in a subject 
comprising a nucleic acid encoding polypeptide of claim 2. 

25 21. A vaccine composition for preventing hepatitis C infection in a subject 
comprising a nucleic acid encoding polypeptide of claim 4. 

22. A vaccine composition for preventing hepatitis C infection in a subject 
comprising a nucleic acid encoding polypeptide of claim 7. 

30 
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23. A vaccine composition for preventing hepatitis C infection in a subject 
comprising a nucleic acid encoding polypeptide of claim 12. 

24. An antibody which binds to a polypeptide of claim 1 . 

5 

25. A kit for dating a hepatitis C infection comprising the polypeptide of claim 1 . 

26. A kit for drtecting a hepatitis C infection comprising an antibody to the 
polypeptide of claim 1. 

10 

27. A method of preventing HC V infection comprising administering the 
polypeptide of claim 1 to a subject or by causing said polypeptide to be synthesized is a 
subject prior to HCV infection such that HCV infection is prevented. 

15 28. A method of diagnosing HCV infection comprising detecting the presence or 
absence of antibodies which react with the polypeptide of claim 1 in the body fluid of a 
subject, wherein the presence of antibodies which bind the polypeptide is indicative of 
an infection with HCV. 

20 29. A method of diagnosing HCV infection comprising detecting the presence or 
absence of the polypeptide of claim 1 in the body fluid or tissue of a subject, wherein the 
presence of an HCV polypeptide is indicative of an infection with HCV. 



. ■ / 
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30. A method for identifying a compound which interacts with the polypeptide of 
claim 1, comprising: 

contacting said polypeptide with a compound in a cell-free system under 
conditions which allow interaction of the compound with the polypeptide such that a 
5 complex is formed; 

separating the compounds which do not form complexes with an HC V 
polypeptide from those which do form complexes with an HCV polypeptide; and 

isolating and identifying the compounds which forai complexes with an HCV 
polypeptide. 
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SEQUENCE LISTING 
<110> Andrea Branch, Jose Walewski, and Dechard Stxmp 
5 <120> NOVEL HEPATITIS C VIRUS PEPTIDES AND USES THEREOF 

<I30> RII-003PC 

10 <140> — 
<141> 

<160> 8 

15 <170> Patentin Ver. 2.0 

<210> 1 

<211> 595 

<212> DNA 

20 <213> Hepatitis C virus 

<400> 1 

gcacgaatcc taaacctcaa agaaaaacca aacgtaacac caaccgtcgc ccacaggacg 60 

tcaagttccc gggtggcggt cagatcgttg gtggagttta cttgttgccg cgcaggggcc 120 

25 ctagattggg tgtgcgcgcg acgaggaaga cttccgagcg gtcgcaacct cgaggtagac 180 

gtcagcctat ccccaaggca cgtcggcccg agggcaggac ctgggctcag cccgggtacc 240 

cttggcccct ctatggcaat gagggttgcg ggtgggcggg atggctcctg tctccccgtg 300 

gctctcggcc tagctggggc cccacagacc cccggcgtag gtcgcgcaat ttgggtaagg 360 

tcatcgatac ccttacgtgc ggcttcgccg acctcatggg gtacataccg ctcgtcggcg 420 

30 cccctcttgg aggcgctgcc agggccctgg cgcatggcgt ccgggttctg gaagacggcg 480 
tgaactatgc aacagggaac cttcctggtt gctctttctc tatcttcctt ctggccctgc 540 

tctcttgcct gactgtgccc gcttcagcct accaagtgcg caattcctcg gggct 595 

<210> 2 
35 <211> 196 
<212> PRT 

<213> Hepatitis C virus 
<400> 2 

40 Ala Arg lie Leu Asn Leu Lys Glu Lys Pro Asn Val Thr Pro Thr Val 
15 10 15 
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Ala His Arg Thr Ser Ser Ser Arg Val Ala Val Arg Ser Leu Val Glu 
20 25 30 

Phe Thr Cys Cys Arg Ala Gly Ala Leu Asp Trp Val Cys Ala Arg Arg 
35 40 45 



Gly Arg Leu Pro Ser Gly Arg Asn Leu Glu Val Asp Val Ser Leu Ser 
50 50 55 60 

Pro Arg His Val Gly Pro Arg Ala Gly Pro Gly Leu Ser Pro Gly Thr 

65 70 75 80 

55 Leu Gly Pro Ser Met Ala Met Arg Val Ala Gly Gly Arg Asp Gly Ser 

85 90 95 
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Cys Leu Pro Val Ala Leu Gly Leu Ala Gly Ala Pro Gin Thr Pro Gly 
100 105 110 

Val Gly hrq Ala lie Trp Val Arg Ser Ser He Pro Leu Arg Ala Ala 
5 115 120 125 

Ser Pro Thr Ser Trp Gly Thr Tyr Arg Ser Ser Ala Pro Leu Leu Glu 
130 135 140 

10 Ala Ala Pro Gly Pro Trp Arg Met Ala Ser Gly Phe Trp Lys Thx Ala 
145 150 155 160 
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Thr Met Gin Gin Gly Thr Phe Leu Val Ala Leu Ser Leu Ser Ser Phe 

165 170 175 

Trp Pro Cys Ser Leu Ala Leu Cys Pro Leu Gin Pro Thr Lys Cys Ala 

180 185 190 



He Pro Arg Gly 
20 195 



<210> 3 
<211> 13 
25 <212> PRT 

<213> Hepatitis C virus 

<220> 

<223> at location 8, X is N or K 

30 

<220> 

<223> at location 9, X is V or E 

<220> 

35 <223> at location 13, X is A or V 
<400> 3 

Leu Asn Leu Lys Glu Lys Pro Xaa Xaa Thr Pro Thr Xaa 
1 5 10 

40 

<210> 4 
<211> 13 
<212> PRT 
45 <213> Hepatitis C virus 

<220> 

<223> at location 6, X is L or S 

50 <220> 

<223> at location 10, X is A or V 

<220> 

<223> at location 11, X is A or V 

55 

<400> 4 

Ala Ala His Arg Thr Xaa Ser Ser Arg Xaa Xaa Val Arg 
15 10 
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<210> 5 
<211> 14 
<212> PRT 

<213> Hepatitis C virus 



10 



<400> 5 

Leu Asn Leu Lys Glu Lys Pro Asn Val Thr Pro Thr Ala Cys 
1 5 10 



15 



<210> 6 
<211> 14 
<212> PRT 

<213> Hepatitis C virus 



20 



<400> 6 

Ala Ala His Arg Thr Ser Ser Ser Arg Ala Val Val Arg Cys 
1 .5 10 



25 



<210> 7 
<211> 18 
<212> PRT 

<213> Hepatitis C virus 



30 



<400> 7 

Pro Thr Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly Lys Val lie Asp 
15 10 15 



Thr Cys 



35 



40 



<210> 8 
<211> 20 
<212> PRT 

<213> Hepatitis C virus 
<400> 8 

Gly Cvs Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg 
is 10 15 



45 Arg Ala Pro lie 
20 



PCX 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) IntcrnaUonal Patent Classification ^ : 
C12Q 1/70, GOIN 33/53 



A3 



(11) International Pubfication Number: WO 99/63941 

(43) International Publication Date: 16 December 1999 (16.12,99) 



(21 ) International AppUcation Number: PCTAJS99/ 12929 

(22) liuernalional Filing Date: 9 June 1999 (09.06.99) 



301 Priiirily Data: 

60/088.670 
60/089.138 



9 June 1998 (09.06.98) US 
11 June 1998(11.06.98) US 



<63l Kclalcd by Continuation (CON) or Continuation-in-Part 
(ClPi to l*:arHer Applications 

US 

Filed on 
US 

Filed on 



60/088,670 (CIP) 
9 June 1 998 (09.06.98) 
60/089,138 (CIP) 
11 June 1998 (11.06.98) 



(81) Deagnated Stales: AU, CA, JP, US. European patent (AT, BE, 
CH. CY, DE, DK, ES, FI. FR, GB. GR, IE, IT, LU, MC, 
NUPT.SE). 



Published 

With international search report. 

Before the expiration of the time limit for amending the claims 
and to be republished in the event of the receifH of amendments. 

m) Date of publication of the international search report: 
^ ' 16 March 2000 (16.03.00) 



(71X72) Applicants and Inventors: BRANCH. Andrea. D. 
(US/USJ; Apartment 6A. 923 5ih Avenue. New York, NY 
10021 (US). WALEWSKI. Jose. L. [USAJS]; Apartment 
7F. 50 E. 98th Street. New York. NY 10029 (US). STUMP, 
Dechard, D. [US/US]; Apartment 5R, 529 East 83rd Street. 
Ney York. NY 10028 (US). 

(74) Agents: DECONTI, Giulio. A., Jr. et al.; Lahivc & Cockfteld, 
LLP. 28 State Street. Boston. MA 02109 (US). 



(54) Title: NOVEL HEPATITTS C VIRUS PEPTIDES AND USES THEREOF 
(57) Abstract 

Novel hepatitis C virus (HCV) polypetides arc provided which are not encoded by the standard HCV open readmg frame. These 
aUemate jSdSw ^ uLft.1, inter alia, in vaccine compositions, in diagnosing HCV infection, and as therapeuuc targets. 



1 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCX on the front pages of pamphlets publishing international applications under die PCX. 



AL 


Albania 


ES 


Spain 


AM 


Amenia 


Fl 


Finland 


AT 


Austria 


FR 


France 


AU 


Australia 


OA 


Gabon 


AZ 


Azerbaijan 


GB 


United Kingdom 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


BB 


Barbados 


GH 


Ghana 


BE 


Belgium 


GN 


Guinea 


BF 


Buflcina Paso 


GR 


Greece 


BG 


Bulgaria 


HU 


Hungary 


BJ 


Benin 


IE 


Ireland 


BR 


Brazil 


IL 


Israel 


BY 


Belarus 


IS 


Iceland 


CA 


Canada 


IT 


Italy 


CF 


Central African Rq)ub)ic 


JP 


Japan 


CG 


Congo 


KE 


Kenya 


CH 


Switzerland 


KG 


Kyigyzstan 


CI 


Cdte d'Woire 


KP 


Dcmocraik People's 


CM 


Cameroon 




Republic of Korea 


CN 


China 


KR 


Republic of Korea 


CU 


Cuba 


KZ 


Kazakstan 


cz 


Czech Republic 


LC 


Saint LAicia 


DB 


Geimany 


LI 


Liechtenstein 


DK 


Denmark 


LK 


Sri Lanka 


EE 


Estonia 


LR 


Liberia 



LS 


Lesotho 


SI 


Slovenia 


LT 


Lithuania 


,SK 


Slovakia 


LU 


Luxembourg 


SN 


Senegal 


LV 


Latvia 


sz 


Swaziland 


MC 


Monaco 


TD 


Chad 


MD 


Republic of Moldova 


TG 


Togo 


MG 


Madagascar 


TJ 


Tajikistan 


MK 


The former Yugoslav 


TM 


Tmkmenistan 




Republic of Macedoaia 


TR 


Turkey 


ML 


Mali 


TT 


Trinidad and Tobago 


MN 


Mongolia 


UA 


Ukraine 


MR 


Mauritania 


UG 


Uganda 


MW 


Malawi 


US 


United Stales of America 


MX 


Mexico 


uz 


Uzbekistan 


NE 


Niger 


VN 


Viet Nam 


NL 


Netherlands 


YU 


Yugoslavia 


NO 


Norway 


zw 


Zimbabwe 


NZ 


New Zealand 






PL 


Poland . 






PT 


Portugal 






RO 


Romania 






RU 


Russian Federation 






SD 


Sudan 






SE 


Sweden 






SO 


Stngipore 







INTERNATIONAL SEARCH REPORT 



lotemational application No. 
PCT/US99/12929 



A. CLASSinCATION OF SUBJECT MATTER 
1PC(6) :C12Q 1/70; COIN 33/53 

US CL :435/5, 7,1, 320.1, 69.3; 424/204.1; 530/300. 350 
According to International Patent Classification qPC) or to both national classificafaon and IPC 



B. PIELDS SEARCHED 



Minimum documenUtion scaichcd (classiGcation system followed by classiOcation symbols) 
U.S. : 435/5. 7.1. 320.1. 69J; 424/204.1; 530/300. 350 



Documentation searched other than minimum documentation to the odcnt that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 

MEDLINE. WEST. DERWENT. EPOABS 
search terms HCV. open reading frame. +1, +2, ORF 



C DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



US 5,350,671 A (HOUGHTON et al.) 27 September 1994, see 
entire document. 



1-30 



rn Further documents arc listed in the continuation of Box C. fl ^ee patent family annex. 



•L" 



■o* 

.p. 



Special catogorioa of citod dooumenU: 

document deflning the general itate of the art which b aot considered 
to be of partkmlar relevance 

earlier document published on or after the international CUing date 

document which may throw doubta on ptiority cbini<s) or which n 
osted to establiA the publicatioo date of another citation or other 
tpeotal raasan (aa tpedlied) 

document referring to an oral disclosure, use, exhibition or other 

document publiihed prior to mtematioQal fUing date but later tfian 
the priority date claiiocd 



later document published aaer the istemaiional fiiiiig date or priority 
date and not in conftict with the application but cited to undentand 
the principle or theory underlying the invention 

-X" document of particular relevance; the claimed invention cannot be 

considered novel or cannot be considered to involve an inventive step 
when the document b taken alone 

nr document of particular relevance; the claimed invention cannot be 

considered to involve an inventive step when the document u 
combined widi one or more other such documents, such combination 
being obvious to a perwn skilled in the art 

document member of the same patent family 



Date of the actual completion of the intematioaal search 
14 DECEMBER 1999 



Date of mailing of the international seareh report 

19 January 2000 (19.01.00) 



Name and mailing address of the ISAAJS 
Commissioner of PaSeats and Trademarks 



Authorized ol 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
I^ADED TEXT OR DRAWING 
(3^BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 



m REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 
□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




